Picture for Bin Qin

Bin Qin

MiCU: End-to-End Smart Home Command Understanding with Large Language Model

Add code
May 31, 2026
Viaarxiv icon

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Add code
May 25, 2026
Viaarxiv icon

OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering

Add code
Apr 09, 2026
Viaarxiv icon

All-in-One Image Restoration via Causal-Deconfounding Wavelet-Disentangled Prompt Network

Add code
Mar 04, 2026
Viaarxiv icon

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Add code
Feb 09, 2026
Viaarxiv icon

GAIA: A Data Flywheel System for Training GUI Test-Time Scaling Critic Models

Add code
Jan 26, 2026
Viaarxiv icon

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

Add code
Jan 15, 2026
Viaarxiv icon

MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus

Add code
Jan 14, 2026
Viaarxiv icon

IMSE: Efficient U-Net-based Speech Enhancement using Inception Depthwise Convolution and Amplitude-Aware Linear Attention

Add code
Nov 18, 2025
Viaarxiv icon

DevPiolt: Operation Recommendation for IoT Devices at Xiaomi Home

Add code
Nov 18, 2025
Figure 1 for DevPiolt: Operation Recommendation for IoT Devices at Xiaomi Home
Figure 2 for DevPiolt: Operation Recommendation for IoT Devices at Xiaomi Home
Figure 3 for DevPiolt: Operation Recommendation for IoT Devices at Xiaomi Home
Figure 4 for DevPiolt: Operation Recommendation for IoT Devices at Xiaomi Home
Viaarxiv icon